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Experimental evidences show that in gene transcription, RNA polymerase has the possibility to be 
stalled at certain position of the transcription template. This may be due to the template damage, 
or protein barriers. Once stalled, polymerase may backtrack along the template to the previous 
nucleotide to wait for the repair of the damaged site, or simply bypass the barrier or damaged site 
and consequently synthesize an incorrect messenger RNA, or degrade and detach from the tem¬ 
plate. Thus, the effective transcription rate (the rate to synthesize correct product mRNA) and the 
transcription effectiveness (the ratio of the effective transcription rate to the effective transcription 
initiation rate) are both influenced by polymerase stalling events. So far, no theoretical model has 
been given to discuss the gene transcription process including polymerase stalling. In this study, 
based on the totally asymmetric simple exclusion process (TASEP), the transcription process includ¬ 
ing polymerase stalling is analyzed theoretically. The dependence of the effective transcription rate, 
effective transcription initiation rate, and transcription effectiveness on the transcription initiation 
rate, termination rate, as well as the backtracking rate, bypass rate, and detachment (degradation) 
rate when stalling, will be discussed in detail. The results showed that backtracking restart after 
polymerase stalling is an ideal mechanism to increase both the effective transcription rate and tran¬ 
scription effectiveness. Without backtracking, detachment of stalled polymerase can also help to 
increase the effective transcription rate and transcription effectiveness. Generally, the increase of 
bypass rate of the stalled polymerase will lead to the decrease of the effective transcription rate and 
transcription effectiveness. But when both detachment rate and backtracking rate of the stalled 
polymerase vanish, the effective transcription rate may also be Increased by the bypass mechanism. 


I. INTRODUCTION 

Replication, transcription, and translation are three basic processes in cells. Before cell division, a cell replicates 
its DNA with the help of DNA polymerase. Using DNA as a template, messenger RNA (mRNA) is synthesized 
by RNA polymerase (RNAP) during the so-called transcription process. Then using mRNA as a template, peptide 
chain is synthesized by ribosome during the translation process, and proteins are then obtained by the folding of 
peptide chains. Roughly speaking, each of the three processes includes three subprocesses, initiation, elongation, and 
termination. The product is synthesized by polymerase during its forward motion along template in the elongation 
process. 
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In the field of theoretical studies, transcription process is usually described by the totally asymmetric simple 
exclusion processes (TASEP), see [1-6]. In which RNAP is regarded as a point particle, and the template DNA is 
regarded as a one-dimensional lattice with lattice sites corresponding to the nucleotides in DNA. The transcription 
initiation corresponds to the binding of RNAP to the first lattice site, where the first site can be regarded as a 
combination of the promoter and the transcription start site. The transcription termination corresponds to the 
leaving of particle from the last site of the lattice. The elongation of transcription is described by the forward hopping 
of particle in the main body of the lattice. The totally asymmetric exclusion means that the polymerase at site 
i can only hopping forward to site i + \ provided the site i -I- 1 is not occupied. In TASEP, the forward hopping 
rates of particle at any site i of the lattice are always assumed to be the same and simply normalized to be 1. It 
implies that RNAP will move along DNA template with constant speed until the termination site. However, several 
experimental observations found that the regular elongation procedure may be interrupted, with RNAP stalled at 
certain nucleotide. The stalling of RNAP may be caused by several reasons. Structural aberrations of the template 
can trigger a stalling of polymerase [7-9] . Polymerase may also be stalled from the depletion of building blocks NTP 
[10], or from the template damage [11-16]. Meanwhile, the damage or incorrect assembling of polymerase itself may 
also lead to stalling [17]. 

In both prokaryotic and eukaryotic cells, there are several mechanisms which are usually employed by polymerase 
to solve the stalling problem. If the stalling is caused by template damage, polymerase may backtrack along the 
template to the previous site and wait for the repair of the damaged site [18-25]. The synthesis of mRNA is able 
to restart after the repair. Alternatively, the stalled polymerase may simply bypass the damaged site and continue 
the transcription process from the downstream site, and finally end the transcription at the termination site with an 
incorrect product [16, 19-21, 26, 27]. Meanwhile, if a prolonged stalling occurs, the polymerase may be degraded 
as a mechanism of last resort [28]. By the way, in translation process, recent experiments have also found that the 
template (mRNA) can degrade when the translocation of ribosome is stalled [29-32] . 

The polymerase stalling as well as the possible mechanisms employed by the stalled polymerase will affect the 
overall transcription rate and efficiency, and consequently have influence on the strength of gene expression. Thus, 
the related properties of transcription are not only determined by the initiation rate and termination rate as implied 
in the usual TASEP model, but also influenced by the polymerase stalling and corresponding mechanisms used to 
overcome the stalling problem. Although there are various kinds of generalizations of TASEP model, no one can be 
used directly to describe the gene transcription process with polymerase stalling. 

In this study, a modified TASEP model is presented to describe the gene transcription process including polymerase 
stalling. For simplicity, this study assumes that there is only one nucleotide in the transcription template at which 
polymerase may be stalled, and the position of this nucleotide is unchanged for any polymerase. This nucleotide 
may be damaged, or bound by protein complexes, or there is one special secondary structure around it. The stalled 
polymerase may backtrack along template to the previous binding site to wait for the repair of the damaged site (or 
clearance of the barrier), or simply bypass this nucleotide and synthesize an incorrect mRNA product, or degrade 
and detach from the transcription template, see Fig. 1. The numerical calculations of our modified TASEP model 
show that, the effective transcription rate may be enlarged by increasing the backtracking rate, detachment rate, and 
bypass rate of the stalled polymerase. Even the transcription effectiveness may be increased with the backtracking 
and detachment rates. Generally, backtracking is one ideal mechanism to solve the polymerase stalling problem. 
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FIG. 1: (a) Modified TASEP model to describe gene transcription process with possible polymerase stalling at site 1. 

Transcription starts with polymerase binding to the first site 0 (with the rate denoted by a), and terminated at the last site 
N with rate denoted by /3. At site I, the forward translocation of polymerase may be stalled. There are three mechanisms for 
a stalled polymerase to leave the damaged site I, backtracking to site I — 1 with rate kb, degrading and detaching from the 
template with rate kd, or just bypassing the site I with rate kbp and continuing its translocation along the template (but the 
inRNA synthesized by it is incorrect and will degrade soon). During transcription elongation period, the forward stepping rate 
of polymerase is denoted hy ks, which is assumed to be the same throughout the transcription, for polymerases in whatever 
states (correct or incorrect), (b) Notations for probabilities of finding polymerase at corresponding states, with correctly 
transcribed mRNA (pi), incorrectly transcribed mRNA (qi), and backtracked polymerase at site I — 1 (c) Notations for 

probabilities related to site I, which may be damaged with or without polymerase binding. 


Without backtracking, detachment and bypass are also good mechanisms to increase the effective transcription rate. 

This study is organized as follows. The modified TASEP model describing the transcription process including 
polymerase stalling will be presented in the next section, and then the results obtained by this model will be given in 
Section III. Finally, concluding remarks will be presented in the last section. 


II. MODIFIED TASEP MODEL FOR GENE TRANSCRIPTION WITH POLYMERASE STALLING 

The model used in this study can be regarded as a modification of the usual TASEP, which is schematically depicted 
in Fig. 1. Where the length of gene is assumed to be A^+1, and each lattice site stands for one nucleotide or nucleotide 
group (which means this model is obtained by coarse grain). The transcription begins with RNA polymerase binding 
to lattice site 0 (corresponding to the promoter upstream the gene) with rate a, which depends on the concentration 
of free polymerase in environment, the binding rate of transcription factors, and the nucleotide sequence of promoter 
[33]. The transcription is ended by polymerase leaving from lattice site N, with corresponding rate denoted by ff 
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This study assumes that only lattice site I may be damaged (or occupied by a protein complex). The rate constant 
that site I becomes damaged is denoted by kpd- If the damaged site I is not occupied by a polymerase, then it can be 
repaired with rate kpr- See the following Eq. (3) for the dynamics of probability that site I is damaged. If the site / is 
damaged, polymerase on it may bypass it directly (with no transcription) and continue its forward translocation along 
template, and finally leave from the stop site N, but the product (i.e. mRNA) synthesized by it will be incorrect and 
will degrade soon. In this study, the probability that there is a polymerase with a correct semi-finished product at 
site i (for 0 < i < N) is denoted by pi, and the probability that there is a polymerase with an incorrect semi-finished 
product at site i (for / -|- 1 < z < iV) is denoted by A polymerase at site i (for i ^ 1) will move to site z -I- 1 with 
rate ks provided site z -I- 1 is unoccupied. 

If the site I is damaged, polymerase at this site will be stalled. Experiments found that there are three possible 
mechanisms for the stalled polymerase to leave the damaged site 1. ( 1 ) The polymerase may backtrack to the previous 
site I — 1 with rate fcf, provided the site 1 — 1 is not occupied. After the repair of site I, the backtracked polymerase 
will return to site I with rate kf. The probability of finding a backtracked polymerase at site 1 — 1 is denoted by 
p'i_i- See the following Eq. (5) for the dynamics of probability ( 2 ) The polymerase may bypass the damaged 

site I with rate k^p, and continue its transcription from the downstream site ^ -|- 1. ( 3 ) The polymerase may degrade 
with rate kd and detach from the transcription template. Note that the genetic information coded in damaged site 
I cannot be transcribed. Therefore, the mRNA synthesized by a polymerase which has bypassed the damaged site I 
is nonfunctional and will degrade soon. Meanwhile, the damaged site I cannot be repaired if there is a polymerase 
binding on it. Thus, if kb = 0, kbp = 0, and kd = 0, the polymerase will be stalled at the damaged site I forever. So 
bypass, degradation, and backtracking are three important mechanisms for cells to continue the transcription process. 
Otherwise, the template will be totally blocked and become useless, and should be degraded. 

This study assumes that each site z can only be occupied by one polymerase. If there is one backtracked polymerase 
at site I — 1, the site I will be unoccupied. This is because that the backtracked polymerase at site / — 1 is from site 
1. In the following, the probability of finding a polymerase at damaged site I is denoted by p. For the model depicted 
in Fig. 1, the probabilities pi are governed by the following equations 


dpo/dt = a{l - po) - kEPoi^ - Pi), 

dpjdt = kEPi-i{l - Pi) - kEPi{l - Pi+i), for 1 < z < ; - 3, 
dpi^2/dt = kEPi-'i{l-pi-2)-kEPi-2{^-pi-i-p'i_i)j 
dpi^i/dt = kEPi-2{l-pi-i-p'i-i)-kEPi-i{l-pi), 

dpi/dt = kEPi-i{l-pi)-kE{pi-p){l-pi+i-qi+i) + kf{l-p)p[_^ (I) 

-kbp{l - pi-i - p'i_^) - kbpp{l - pi+i - qi+i) - kdp, 
dpi+i/dt = kEipi - p){l - Pi+i - qi+i) - kEPi+i{l - Pi+2 - qi+2), 

dpi/dt = kEPi-i{l - Pi - qi) - kEPiil - Pi+i - qi+i), ior I + 2 < i < N - I, 

dpEf/dt = kEPN-i{^-PN - qN) - Ppn- 

Where the equations for 0 < z < Z — 3 and / -|- 2 < z < can be obtained similarly as in usual TASEP model. The 

total probability of finding polymerase at site I — 1 is pi-i + p'i_i, where pi-i is the probability of polymerase which 

comes from site I — 2, and p'i_i is the probability of polymerase which is backtracked to site I — 1 from the damaged 
site 1. The probability flux from site 1 — 2 to site I — 1, which is related to the governing equations of probabilities 
pi -2 and pi-i, is kEPi- 2 i^ — Pi-i — p'l-i)- In the governing equation of probability pi, the first term is the flux from 
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site Z — 1 to site 1. The second term is the flux from undamaged site I to site / + 1, where pi — p is the probability 
that there is a polymerase at site I and the site I is not damaged. The third term is the return flux from site / — 1 to 
site I of the backtracked polymerase, where 1 — p is the probabilty that the damaged site / has been repaired. The 
forth term is the backtraking flux. The fifth term is the bypass flux, and the final term is the detachment flux. The 
governing equation for probability p;+i can be obtained similarly. 

Meanwhile, the probabilities qi satisfy (see Fig. 1(b) for the meanings of probabilities qi) 

dqi+i/dt = kbpp{l-pi+i-qi+i)-kEqi+i{l-pi+ 2 -qi+ 2 ), 

dqi/dt = kEqi-i{l-Pi-qi)-kEqi{l-Pt+i-qi+i), for Z + 2 < i < iV - 1, (2) 

dq^/dt = ksqN-iii - Pn - qN) - l3qN- 

Where the probability p that site I is damaged satisfies 

dp/dt = kpd{l - p) - kpr{p - p). (3) 

In which, the second term is from the assumption that only the unoccupied site I can be repaired. The probability p 
that there is a polymerase at the damaged site I can be obtained as follows 

dp/dt = kpd{pi - p) + kEPi-iip - p) - fcbpP(l - qi+i -Pi+i) - kdP - kbp[l -p'l^i -Pi-i)- (4) 

Where the first term is the flux of probability that the occupied site I becomes damaged. The second term is the flux 
of probability that a polymerase translocates from site Z — 1 to the unoccupied but damaged site 1. The last three 
terms are bypass flux, detachment flux, and backtracking flux, respectively. Finally, the probability p[_i that there 
is a backtracked polymerase at site I — 1 satisfies 

dp[_i/dt = kbp{l -pi-i -p'l-i) - kf{l-p)p'i_^. (5) 

Where the first term is the backtracking probability flux of the stalled polymerase from the damaged site I to its 
upstream site Z — 1, and the second term is the return probability flux of the backtracked polymerase. For convenience, 
meanings of probabilities pi,p, p are displayed in Fig. 1(c). The total probability of finding a polymerase at site i, no 
matter whether it is with a correctly synthesized mRNA or an incorrect mRNA, is denoted by Pi, or mathematically, 

{ Pi, for 0 < Z < Z — 2 or i = Z, 

pi-i+p'i_i, forZ = Z-l, (6) 

Pi + qi, ioT I + 1 < i < N. 


III. RESULTS 


All results of this study are based on the steady state solution of Eqs. (1-5), which are obtained by numerical 
calculations performed in software MATLAB. To illustrate the properties of gene transcription with possible stalling 
of polymerase at a given position, typical examples of related probabilities, obtained by the modified TASEP model, 
are plotted in Eig. 2. Where Figs. 2(a-d) are for the cases where kb = kd = 0, i.e., the backtracking rate to the 
upstream site Z — 1 and detachment rate from site Z for stalled polymerase at site Z vanish. For these special cases, the 
total probability Pi of finding polymerase at site i may have three different phases, low density phase (Figs. 2(a,c)), 
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TABLE I: Parameter values used in the calculations of Fig. 2. 


high density phase (Figs. 2(b)), and maximal flux phase (Figs. 2(d)). Where the probability flux is defined by 
Ji = KEPi{l — Pi), which reaches its maximal value 1/4 when Pi = 1/2. Meanwhile, boundary layers may exist 
at one or both of the two boundaries, z = 0 and i = N. These properties are similar as the ones of usual TASEP 
models, and the probability and corresponding flux are fully determined by the transcription initiation rate a and 
the transcription termination rate (3, see [3, 34]. For general cases with nonzero values of backtracking rate kb and 
detachment rate kd, the plots in Figs. 2(e-h) show that, the probability Pi has a sharp change at the site /. For 
the sake of comparison, except kb and kd, other parameter values used in (e)-(h) are the same as the ones used in 
(a)-(d) respectively, see Table 1. Since polymerase at damaged site I may degrade and detach from the transcription 
template, different with the ones plotted in Figs. 2(a-d), the probability flux Ji = KEPi{3 — Pi) for the general cases 
is not conversed along the template. Meanwhile, since polymerase can only detach or backtrack from the damaged 
site I, the flux Ji is conversed both in the region between site z = 0 and site z = ^ — 1 and in the region between site 
z = Z + 1 and site i = N. Because of the detachment, the probability flux will be reduced after site 1. It means that 
for the cases of low density phase and maximal flux phase, the probability Pi will be reduced after site I (see Figs. 
2(e,g,h)), while for the cases of high density phase, Pi will be increased (see Figs. 2(f)). Due to the backtracking of 
polymerase from damaged site I, the total probability of finding polymerase at site I — 1, Pi-i, may be higher than 
those of other sites, see Figs. 2(g). 

The plots in Figs. 2(a-d) imply that, without polymerase detachment, the total transcription rate may not be 
reduced by the site damage. But the effective (or correct) transcription rate, i.e., the rate to synthesize correct 
mRNA, will be reduced. Given the nonzero bypass rate kbp, some products are incorrect and will degrade rapidly. 
With additional detachment of polymerase from the damaged site I, for general cases, the effective transcription rate 
is less than the effective transcription initial rate, see Figs. 2(e-h). In this study, the effective transcription initial rate 
is defined as cteff '■= a{l —po), the effective transcription rate is defined as /?e// := Ppn, and the bypass transcription 
rate is defined as Pbp := PqN- In the following, the parameter dependent properties of tte//, Peff, Ptp, and the 
ratio r := ffff/aeff will be discussed in detail. The ratio r is one reasonable index to describe the effectiveness of 
transcription including polymerase stalling. 

The plots in Fig. 3 show that with nonzero detachment rate kd, all the effective rates cteff, Peff, and /3bp decrease 
with the position I of damaged site, but the transcription effectiveness r increases with I (see the lines in Fig. 3 with 
marker “o”). Given that polymerase can only detach from the damaged site /, if the location / of damaged nucleotide 
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FIG. 2: Typical examples of probabilities Pi (solid lines), qi (dashed lines), Pi_i, p, and p (given in legends) along the gene, 
which are obtained from Eqs. (1-6) with gene length N = 100. (a-d) are for simplified cases where detachment and backtracking 
of polymerase from damaged site I are not allowed, i.e. kt, = kd = 0. While (e-h) are for cases with nonzero detachment rate 
and backtracking rate. Except the values of k^ and kd, other parameter values used in (a-d) are the same as the ones used in 
(e-h) respectively, see Table I. (a,c,e,g) are examples of the low probability density case with probability less than 0.5, (b,f) 
are examples of high density case with probability larger than 0.5, and (d,h) are examples of maximal flux case. The sharp 
decrease of probability Pi after site I — 100 is due to the polymerase detachment from site 1. For the meanings of probability 
notations, see Fig. 1 

is far from the initiation site 0, then the polymerase density between sites 0 and I will be high and consequently the 
polymerase current along gene will be low. Thus, the effective transcription initiation rate cteff is decreased with /. 
Except from the damaged site I, polymerase can not detach from the transcription template, therefore low effective 
initiation rate will lead to low transcription rate. Thus, effective transcription rate /3e// and bypass transcription rate 
Pbp also decrease with the damaged position 1. The increase of transcription effectiveness r with damaged position 
I implies that large values of damaged position I will be beneficial for cells to increase the transcription efficiency 
and save energy molecules. On the other hand, for non-detachment cases, i.e., kd = 0 (see the lines in Fig. 3 with 
marker “*”), the effective initiation rate cxeff is independent of damaged position I, but the effective transcription 
rate Peff increases with 1. Thus the transcription effectiveness r = Peff/aeff also increases with damaged position 1. 
Therefore, for any case (with or without detachment of polymerase from the damaged site), large values of damaged 
position I will help to increase the transcription efficiency. The plots in Fig. 3 also show that except for the cases 
where the damaged site of template is close to the transcription start site or termination site, Ueff, Peff, Pbp, and r 
are not sensitive to the damaged position 1. 

Fig. 4(a) shows that the effective transcription initiation rate cteff increases with the initiation rate a, and tends to 
approach one limit value. In the calculations of Fig. 4(a), the line with marker is obtained with large termination 
rate /3 and nonzero detachment rate kd, the line with marker “o” is obtained with small termination rate /3 and 
nonzero detachment rate kd, and the thick solid line is obtained with small termination rate /3 and zero detachment 
rate. Thus, the plots in Fig. 4(a) also imply that the initiation rate limit of the effective rate cteff increases with the 
termination rate /3 and detachment rate kd- For large initiation rate a, the effective transcription rate /3e// also has 
one limit value, which increases with termination rate /3 and detachment rate kd, see Fig. 4(b). But, different with 
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TABLE II: Parameter values used in the calculations of Figs. 3-10. 


the effective initiation rate cteff, Peff nray not change monotonically with initiation rate a. The plots in Fig. 4(c) 
show that the bypass transcription rate fbp increases with the initiation rate a, and tends to one limit value when 
a is large enough. The limit value of fbp increases with termination rate f but decreases with detachment rate kd- 
This is because, for large values of detachment rate kd, polymerase will have less chances to reach the stop site of the 
template. Finally, the transcription effectiveness r decreases with initiation rate a, and its limit value increases with 
both the termination rate (3 and detachment rate kd, see Fig. 4(d). 

Except the bypass transcription rate fibp, both of the two rates oceff and Peff, and the effectiveness r, increase 
monotonically with the termination rate /3, and tend to approach corresponding limit values for large /?, see Fig. 
5. The backtracking of stalled polymerase at damaged site I can help to raise the transcription effectiveness r (see 
the line with marker “o” in Fig. 5(d)). For high termination rate /?,/?> 0.5, backtracking also helps to raise the 
effective transcription rate f3eff (see the plots in Fig. 5(b)). Therefore, high termination rate and backtracking 
rate are beneficial to getting high effective transcription rate and to increasing the transcription effectiveness. With 
backtracking but no detachment, the stalled polymerase at damaged site I will have additional chance to continue its 
transcription. 

Without detachment, i.e., kd = 0, there are only two mechanisms for the stalled polymerase to leave damaged site 
I, backtracking to site I — 1 and waiting for the repair of site I or bypassing the damaged site I and continuing its 
transcription from site Z -|- 1. With the increase of backtracking rate kb, the translocation of polymerase along the 
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i i 

FIG. 3: The effective transcription initiation rate ae// := a(l — Po) (a), effective (or correct) transcription rate j3sff := Ppn 
(b), bypass transcription rate /3j,p := /Igjv (c), and the transcription effectiveness r := P^ff/cteff (d), as functions of the 
position I of damaged site, which changes from 10 to 190 with an increment 10. For other parameter values, see Table II. The 
only difference between the two lines in each figure is that, for the lines with marker the detachment rate fed is equal to 
zero, while for the lines with marker ‘o’, the detachment rate kd is nonzero. 



FIG. 4: The effective transcription initiation rate ce^ff (a), effective transcription rate /9e// (b), bypass transcription rate Pbp 
(c), and the effectiveness r := /3e///Q!e// (d), as functions of the transcription initiation rate a. In each figure, three typical 
examples are plotted. In calculations, the initiation rate a changes from 0.1 to 1 with an increment 0.1, and other parameter 
values are listed in Table II. In contrast to the lines with marker ‘o’, the thick solid lines are obtained with zero detachment 
rate, i.e., kd = 0, while the lines with marker ‘>i<’ are obtained with larger termination rate /3. 
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FIG. 5: The effective transcription initiation rate cteff (a), effective transcription rate /3e// (b), bypass transcription rate f5bp 
(c), and transcription effectiveness r (d), as functions of the termination rate /3. The lines with marker ‘o’ are obtained with 
zero detachment rate kd but nonzero backtracking rate kf,, while the lines with marker are obtained with nonzero detachment 
rate kd but zero backtracking rate ki,. In calculations, the termination rate {3 changes from 0.1 to 1 with an increment 0.1, for 
other parameter values, see Table II. 

template will be slowed down. Thus, the effective transcription initiation rate cteff decreases with backtracking rate 
kb, see Fig. 6(a). Given that there are only two mechanisms for the stalled polymerase to leave damaged site I, 
the increase of backtracking rate will lead to the decrease of the probability of bypass. This implies that, the bypass 
transcription rate decreases with backtracking rate kb, see Fig. 6(c). Finally, the plots in Figs 6(b,d) show that, both 
the effective transcription rate /3e// and the transcription effectiveness r increase with backtracking rate kb. Thus, 
backtracking is one of the ideal mechanisms for cells to solve the stalling problem. 

If the stalled polymerase can only continue its translocation by the bypass mechanism, i.e. bypass the damaged 
site I and continue its transcription from site / + 1, and cannot backtrack to site / + 1 or detach from the template, 
then the effective transcription initiation rate Oe//, the bypass transcription rate ffp, and the effective transcription 
rate /?e// will all increase with the bypass rate bbp, see Fig. 7(a-c). The increase of rate /3e// with bypass rate bbp 
is because, with large values of kbp, the polymerase with correctly synthesized mRNA will have less possibility to be 
blocked during its transcription process. However, the plots in Fig. 7(d) indicate that transcription effectiveness r 
decreases with bypass rate kbp. Besides the bypass mechanism, if the stalled polymerase can also backtrack to the 
previous site / — 1 to wait for the repair of the damaged site I, then the rates aeff,l3eff, and effectiveness r will be 
increased (see the lines with markers “o” and in Figs. 7(a,b,d)). The lines plotted in Figs. 7(c) with markers 
“o” and show that, with additional backtracking mechanism, i.e., kb ^ 0, the bypass transcription rate ffp will 
be reduced. Meanwhile, all the solid lines and the lines with marker “o” in Figs. 7(a-d) show that with additional 
detachment mechanism, i.e., kd 0, all the effective rates aeff,l3eff, and /3bp, and the transcription effectiveness r 
will be reduced. This implies that the detachment of stalled polymerase may not be one good mechanism for cells 
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FIG. 6: The effective transcription initiation rate cteff (a), effective transcription rate /3e// (b), bypass transcription rate f5bp 
(c), transcription effectiveness r (d), as functions of the backtracking rate In calculations, fc(, changes from 0 to 1 with 
an increment 0.1. The detachment rate kd is set to zero, i.e. stalled polymerases at damaged site I will not detach from the 
template. Other parameter values used in calculations are listed in Table II. 

to solve the transcription stalling problem, and to increase their transcription rate and efficiency. The plots in Fig 7 
also show that, for the special cases with either nonzero detachment rate or nonzero backtracking rate, the effective 
rates aef f, (3ef f, Pbp only change slightly with the bypass rate k^p. 

The plots in Fig. 8(a) show that, generally, the effective initiation rate Oe// increases with the detachment rate of 
stalled polymerase at damaged site 1. Because for large values of detachment rate, the polymerase density between site 
0 and site I will be low. Therefore, the effective initiation rate Oe// = o(l ~ Po) will be large. But for the cases with 
large termination rate ff cteff is almost independent of detachment rate kd, see the thick solid line in Fig. 8(a). This 
is because, for large termination rate /3, the polymerase density along transcription template is low enough, and the 
influence of detachment of stalled polymerase can be neglected. In other words, detachment will not help to reduce 
the polymerase density any longer. From the plots in Fig. 8(b), one can see that for low termination rate /?, the 
effective transcription rate ffff increases with detachment rate kd- The reason is that large detachment rate kd will be 
helpful to reduce the polymerase density along the transcript template. Consequently, the mean translocation speed 
of polymerase will be high. But for large values of termination rate /?, ffff decreases with detachment rate kd (see 
the thick solid line in Fig. 8(b)). Given large values of ff the polymerase density along template will be low enough 
such that each polymerase can translocate forward freely. Thus, with large values of detachment rate kd, polymerase 
will have less opportunity to complete its whole transcription process. This means that the effective transcription 
rate /3eff will be low for large detachment rate kd- Because there are altogether three possible mechanisms for stalled 
polymerase to leave the damaged site I, i.e., backtracking, detachment, and bypass, the bypass transcription rate (3bp 
will be low for large detachment rate kd, see Fig. 8(c). The plots in Fig. 8(d) show that, for the cases with nonzero 
backtracking rate kb, transcription effectiveness r decreases slightly with detachment rate kd- But for the cases with 
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FIG. 7: The effective transcription initiation rate cteff (a), effective transcription rate /3e// (b), bypass transcription rate f5bp 
(c), and the transcription effectiveness r (d), as functions of the bypass rate kbp. In each figure, three typical examples are 
plotted. Where the lines with marker ‘>i=’ are obtained with kd = 0 and kb yf 0, the thick solid lines are obtained with kd ^ 0 
and kb — 0, and the lines with marker ‘o’ are obtained with kd = kb = 0. The bypass rate kbp changes from 0 to 1 with an 
increment 0.1. The values of other parameters are listed in Table II. 

zero backtracking rate, effectiveness r increases with kd- This implies that when no backtracking, detachment is one 
good mechanism to solve the polymerase stalling problem. But generally, backtracking may be better than detachment 
at increasing the transcription efficiency. 

Figs. 9(a-c) show that all the effective rates, ae//j Peff, and Pbp increase with the return back rate kf of the 
backtracked polymerase. Since large value of rate kf means that the backtracked polymerase at site I — 1 will return 
back to site I quickly when the damaged site I has been repaired, and then restart its transcription. But the plots in 
Fig. 9(d) show that for nonzero detachment rate kd and low termination rate /?, transcription effectiveness decreases 
with rate kf. Given low termination rate /3, the polymerase density between sites I and N will be high, so the increase 
of return back rate kf has little influence to increase the effective transcription rate ffff (see the line in Fig. 9(b) 
with marker “*”). But for nonzero detachment rate kd, the polymerase translocation between sites 0 and I may be 
uncrowded, thus, the effective transcription initiation rate cte// increases with the return back rate kf (see the fine in 
Fig. 9(a) with marker “*”). Therefore, from the definition of transcription effectiveness, r := P^ff/cteff, for the cases 
with low termination rate /3 but nonzero detachment rate kd, transcription effectiveness r decreases with the return 
back rate kf. Therefore, large return back rate kf may not be helpful to increase the efficiency of transcription. 

Finally, the plots in Figs. lO(b-d) show that the effective transcription rate /?e// and transcription effectiveness r 
decrease with the damage rate kpd of site I, while the bypass transcription rate Pbp increases with kpd- For high damage 
rate kpd, the polymerase is more likely to be stalled at the site I, and then the possibility of bypass will be high and 
the synthesis speed of correct mRNA will be low. The plots in Fig. 10(a) imply that, for low termination rate (3 and 
low backtracking rate kb, the effective initiation rate Qfe// increases with damage rate kpd. Given low termination rate 
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FIG. 8: The effective transcription initiation rate Qe// (a), effective termination rate /3e// (b), bypass transcription rate fitp 
(c), and the transcription effectiveness r (d), as functions of the detachment rate kd- The lines with marker ‘o’ are for the 
cases with zero backtracking rate kt, and the thick solid lines are calculated with large transcription termination rate /3. In all 
calculations, the detachment rate kd changes from 0 to 1 with an increment 0.1. For other parameter values used in calculations, 
see Table II. 



FIG. 9: The effective transcription initiation rate cteff (a), effective termination rate fieff (b), bypass termination rate /3bp 
(c), and the transcription effectiveness r (d), as functions of the forward rate kj (see Fig. 1(a) for the meaning of kj). The 
lines with marker ‘o’ are obtained with zero detachment rate kd and high termination rate /3, while the lines with marker 
are obtained with nonzero detachment rate kd and low termination rate p. In all calculations, the forward rate kf changes 
from 0.1 to 1 with an increment 0.1. The values of other parameters are listed in Table II. 
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FIG. 10: The effective transcription initiation rate oteff (a), effective termination rate /3e// (b), bypass termination rate fitp 
(c), and the transcription effectiveness r (d), as functions of the damage rate k^d- The lines with marker ‘o’ are obtained with 
zero detachment rate kd, the lines with marker ‘>i=’ are obtained with zero bypass rate kbp, and the thick solid lines are obtained 
with zero backtracking rate kb and low termination rate /3. In all calculations, the damage rate kpd changes from 0 to 1 with 
an increment 0.1. The values of other parameters are listed in Table II. 

/3, the polymerase density along transcription template will be high and the translocation speed of polymerase will 
be low. With the increase of damage rate kpd, the stalled polymerase will have more possibility to detach from the 
template. So the total leaving rate of polymerase from the transcription template, either from the stop site N or from 
the damaged site I, will increase. Thus, the effective initiation rate cteff increases with the damage rate kpa- The line 
with marker “o” in Fig. 10(a) also show that, without detachment, the effective initiation rate cteff decreases with 
damage rate kpd- This is because, for large damage rate, polymerase will be more likely to be stalled at the damaged 
site I, and consequently, the translocation speed of polymerase along transcription template will be slowed down. 


IV. CONCLUDING REMARKS 

In this study, a modified TASEP model is presented to describe gene transcription process including polymerase 
stalling. Because of the detachment (or degradation) of polymerase from the damaged site, the polymerase density 
along transcription template has a sharp change at the damaged site. As in usual TASEP models, the polymerase 
density may have boundary layers at the transcription start site and termination site. In the main body of the 
transcription template, the polymerase density may be in three phases, low density phase, high density phase, and 
maximal flux phase. But the phases in different regions of the transcription template may be different. This study 
showed that the effective transcription rate (the rate to synthesize mRNA correctly), and the transcription effectiveness 
(the proportion of correct transcription) will be high if the damaged site of the transcription template is close to the 
transcription termination site. The increase of transcription initiation rate will increase the effective transcription 
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rate, but decrease the transcription effectiveness. On the other hand, the increase of transcription termination rate 
will increase the effective transcription rate and the transcription effectiveness, as well as the effective transcription 
initiation rate. 

Experiments found that there are three mechanisms for cells to solve the polymerase stalling problem, backtracking, 
bypass and detachment. This study showed that the increase of backtracking rate will lead to the increase of effective 
transcription rate and the transcription effectiveness, but lead to the decrease of the effective transcription initiation 
rate. It implies that backtracking is one of the ideal mechanisms to increase the synthesizing rate of mRNA and the 
transcription efficiency. Without backtracking and detachment, the increase of bypass rate will lead to the increase 
of effective transcription rate. But for general cases, large values of bypass rate will lead to low values of effective 
transcription rate and the transcription effectiveness. Similarly, without backtracking, detachment (or degradation) of 
the stalled polymerase is one good mechanism to solve the stalling problem. But for nonzero backtracking rate cases, 
the increase of detachment rate may lead to the decrease of effective transcription rate and transcription effectiveness. 
As expected, the increase of damage rate of the transcription template will lead to the decrease of transcription 
efficiency. 

The results obtained in this study will be helpful to the understandings of gene transcription in living cells, and the 
mechanisms used by cells to solve the polymerase stalling problem. The model presented in this study can be further 
generalized to discuss more general cases of gene transcription process in which polymerase may be stalled at more 
than one site of the transcription template. The model parameter values in real cells may be extracted through the 
NET-seq approach presented in [35] , and then the theoretical model given in this study can be used to do quantitative 
analysis of the gene transcription process with polymerase stalling. Einally, combining this study with the recent 
model presented by Choubey et al in [36], more details of the transcript process can be better understood. 
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